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ABSTRACT 

The gramnar which SHROLU uses to parse sentences is outlined in a 
series of flowcharts which attempt to modularize and illuminate its 
structure. In addition, a short discussion of systemic granmar is 
included. 
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The purpose of these flowcharts is to melee available to the general) 
t^on-KHRDIU-hecking public, the parser which iiHHDLU uses, There have beer, 
rany who have tried to decipher its code; most have either become 
lopelessly entangled or gainfully niace their way through its web. Now, 
it last, ever you car; have in your heme a complete set of SIIKDLU 
flowcharts, besides raking the cede generally more comprehensible, the 
lloweharts de-eiojhasJze the linearity of the parking program and instead 
trganize it into modules. Thus the reader can see the outline of a large 
»art of the jrocess at a fairly high level arid only later turn to another 
iage to yonder the details. Hopefully, their availability will encourage: 
ether system implementers to use the rairser as a front end 3rd will spark 
Lome crors— cultural communication between the non- computing linguistic 
community an<- AI language workers* 

'ihete fJowclarti* are best read with a copy of WinogradV. thesis or 
look by your side. Although a few characteristics of sy£teaic\*rammfcr 
*re rentionee below, its basic philosophy is not detailec here. Neither 
is the design and operation of PhOCRAMMAI:, the language in which the 
parser ia written. Working through the FROGRAMMAR ejenples in the 
thesis(pcge 153 in the thesis version or page 98 in /the took) is a good 
jrelircinary exercise in following the operation of tt*e grarmar. Of 
course, the charts are also leant to be a guide for reading the code; 
they explicate the crucial section*;, avoiding sore of the kludges which 
t-akc the grautar difficult to decipher. You should not expect, however, 
that the correspondence tetween the flow charts and either other source 
vill be perfect. The thesis explicates e more ccrcplete franuratical 
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theory than that which the program en- bodies; in addition, feature names 
Lometines diifer radically* While the charts stick u.ore closely to the 
» rc,^raj;. than to the thesis, several Lections of code have been omitted 
md/cr "cisrepresented, w most notably the backtracking sections 
^precisely because they vre uostly ftc hoc). Don't be thrown by &ucb 
i.ifferences* You are net missin/* anything or bein# stupid; they art: 
(hopefully) intentional* 

A XHW WUKLS ABOUT SYSOEMIC GRAtiKAh 

ioris vs, Function 

A bteic diclotory which sycteiaic gnurjnar considers is that letween 
the form and function of a particular unit* Fort refers to the internal 
; tructurc of a ui.it. what constituents it car: have* how they must he 
Mranfed arid what dependencies exist araonn them. Function censioers a 
unit's place in the "higher order of things 11 : what roles it can plaj in 
higher ccnstituer*ts end what dependencies it enters into* For example, 
the form of en IiC may be represented as several r.on-indejendent lIqU. 
which need tc be filled; the KG pro/vam is primarily concerned with 
j illin^ those slots &nd cjnassinc features (e.p> number and person)* Its 
: unction is indicatec by the places in other programs from which it is 
called: j*s the object of a preposition, first or second object ol a vert, 
subject of a clause, constituent of an A£JG(e,£. as big as the tell) etc. 

Tht form oi a unit is evident in the flowcharts, detailing its 
yrogram; » unit's functions **re indicated by the places where an attenpt 



to wrse it is made (i.e. whrre /unitX * c found) There ore clearly n 
multitude of functions lor a unit, i.eraerber that there are also many ana 
varied fcrrcs. For example, an HC containing "anything" has © different 
jorr; froc the garden- variety filled-clot 1JC end the for© of t VC cannot 
io easily detailed in a slot-and-filler account, though s prc^ratt does a 
I ood ;job of describing its fcrm. 

Of course, a constituent's function may affect its font. A primary 
use of features in this parser is to communicate such effects up and dour: 
the tree. CLAUSE ma\ call HC with the initial feature OLJ, which 
; pecifiee objective case for any pronoun the UG ray come up with. J*C f in 
turn, ma,\ caJl CIAUSI with the initial features KSMG (rank-shifted noun 
;rouj.) 310 (as it. " Ijgftgiafl Bulgarian dances is great fun. n ), which allows 
the clause tc have no subject or a pessecsive subject and reouircs it to 
have an II-C verb rroup. /. ^ocd thin/* to focus on in reading the 
jlowcharts is the effect of such features. Often a jrorjam will, a£> it:; 
iirst action, dispatch to pieces of <.ode which know how such functional 
ieatures affect form. 

I ore on leatures 

Uhile we're on the subject, nore about the features a systemic 
,rarj.ar uses. 3c far we've teen referring tc their syntactic relevance 
t_nd how they direct the rarsiry- process. One of systemic grammar's 
clairs tc faiie is that fcatuics have both syntactic Mid :;eEar;tic 
t ipjxii ic&nce, thus psrtinlly bridrin/ the rai between syntax and 
Lemantics which has Veen recently so laocntcti. lor cxacile, the feature 
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LilF en ar, IJC not onl;. constrains its form but also tells us that a 
specific object cr objects are bein/* referred to (e.f . the hsppy smiling 
rabbit) The feature IHT, Riven to clauses with intensive verbs such as 
1£, tells us structurally to expect i*n IIC V AUG cr PliEFC acting es a 
complement alter the vert but also indicates that the coinplerent will be 
i, property of the subject of the sentence as in "The rabbit is cute." 
Mmilarly, an K3C ( rank-: ~h if ted qualifier) clause has a definite 
i tructure in wheh one of its constituents wiJl be the neun it ic 
] odifyiru * It also rives us the serr;oitic information thet the object 
icinr roierred tc by the entire Mi roust satisfy the semantic constraints 
ef the ftLC. linally, a TIME J.C has a very definite structure and dual 
contain i tire wcrd such as "yesterday* 1 or "uonth" (us in "lust tonth"); 
it has otvious secantic significance in that we hnow it affects the time 
referred to ty the utterance, which woulc otherwise fce deterrineti solely 
by the tense - 

The flowcharts indicate when sore of the more important features are 
i ddec to nodes. (Koticc that one way features are added is by sending 
initial features alonr in the attempt to parse) There are, in addition, 
eecision joints base*: on testing lor a particular feature - the rest 
relevant of these are also netcd in the flowcharts* host feature nar.es 
trc cither found in the thesis or are inter pre tcble froa their epellin,^ 
with a little thcurfit- TOOK, for example, is the feature oi a verb 
which tabes vn HiAXi 'iC as its second object cs "I asked bin to scratch n; 
Lack." lit this sentence, "hir-", an HC, is the first object, while tU 
CLAUSE SShG TO "to scratch my back" is the second. 1RAHSINT refers to n 
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verb which Ukes an object ard a cow* leiaent as "I consider him art idiot." 
/. niost important feature, RKI -NCG-FOi IiD, is irarked on an RSG clause which 
las not vet cetermined which of its constituents' places its REUJ£AD (the 
i oun it is modify in?) will tske. 

Thic treatment tf features implies some uniformity in their use; 
jctu^lly feature.* serve three distinct purposes in this parser. The fort 
(fa />iv< n constituent is clearly incicated fcy its features; its function 
is also co rourkec, although this information is soroevhat redundant with 
the contents of the registers* (sec t* low) All of the features relating 
to function coule be easily ferived usinf the registers and £ome simjle 
\ roe-traversing. A third, very different use? of features is as temporary 
variafclet during the pairing process- RiL-KOT-FCUHD, for example, iz 

enly temporarily att&chcc : to a clause and is not a feature referring to 
cither function cr form which rernin-r on a node at the end of the parsinf 
jroccss. Such features :-erve to direct the parsing process tut are not 
mi end result of it- 

i emitter*. 

Certain of the subrorts of a cor.stituent will always be crucial to 
its :;eniG2 tic analysis; such subparts are kept in registers sc they will 
ie readily avail; fcle to the semantic specialists as well as later 
eyntactic processes. The mu;e of a register is in general an obvious 
five-awaj 1 for its roesninf: SUBJECT, OU1 v IXXaCAIr-SUBJP;CT,(in passive 
: <mtence;:) , J Mh(vnin vcrl). 1IKA1/ i:- the register used to hold tic head 
j oun of :n }X- or tho hew. nd.icctivn of an AUJtt; IhLIilAll Lold: the HO 
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which an UiiCJ modifies am; is used by the USQ CLALSL jroffrarc when it lookc 
i round ftr its missing constituent. Such registers ndd no theorcticcl 
1 ower to the rar^r, since a constituent can always te found by moving 
*- round the perse tree, but they add to the eese of program writing* 
reading end c*ccuit:entation; they ere another way which shows up clearly in 
Ihesc flowcharts for constituent prop rase to corouunicate information 
j nonf themselves* 

OPtfLATIOh 01 TIE PARSHl 

As explained in iron detail in the thesis, the parser works, as do 
i est sue] parser:., by buiUdir:/* a par^e tree* Each node is imrked as to 
features, daughters, parent, worths it contains and relevant registers. 
;,emai:ticr are initially present only at the word level; such definitions 
fro i torcd in the dictionary- At crucial points in the framrar (retrains 
(which on iidicrtcd on the flowcharts as "ChhCK SEMANTICS 11 ) semntic 
tpecialifts #re called. ITcy combine the semantics: of the comjoner.to into 
i sei.nntic structure for that cart oi the unit parsed so far. At lerst 
me sonar-tic specialist exists for etch syntactic unit. KG* for example, 
loa&tfi two: one which handler* that ntrt cf the KC through the noun, end 
the second which worries about the semantics of qualifiers occurring 
i f ter the noun* Such a specialist win also report that it's not hapry 
vith the coclination of words the parser came up with for this 
constituent; its coraplaint urually cruses the unit to por off itf la£t* 
parsed constituent or, it; moie radical cases, to threw up its hands end 
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return a failure *essa£a to the program which called it- This semantic 
guidance prevents an attempt to parse "The i*irl pave the hoy plants to 
water 11 aiiala^ ously tc "The rirl fave the house plants; to charity." i;nd 
is one of the more interesting features cf the parsing stratery. 

IlsOEKU VUt RbAEU.G TIE flOVCHARIS 

Lacl na^e ol the flowcharts is titled; in addition, papes are 
j Jibe] led with <nr>ire cf unit> / <rare within that unit> and such 
< Coif nations aro usee on other rf.re;; to refer to thir section of the 
flowcharts, t.c* kS/1 or CL/10, 




*Lft*£L' 



r*LftH^L+ 



.fifchL 



FALSE unit with piven features (eouivalent to c call tc 
PAUSE) 



FALSE this particular word (equivalent to c call to 
BMi&E with hIL as the first arfumer-t) 



LAIJlL in gramnar pro/rrrBs, loc&l to the pafe it appear:; 
on. 

LAJ KL in grammar programs. The coce at this label ia 
accessed from sore other p&fte in the flowcharts. 

TiAI l*X in granaar prograins. The code at this l&bel is 
detailed on the pafe indicated under the lrbel* 
(Arolofies for the confusing closeness of this notation 
anc : the one above it*} 



Three I ranches leading froir an attempt to lAliSt 
taker: if attempt succeeds 
taken if attempt fails 
taken if i«rse :n;ccords x*nd there* are; no nsorc woids 
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left in tht sentence 



Drench point* relevant test is detailed inside Oiaxond- 



A large taek which is detailed on another ia ; <^ headed 
by thitt naxe and designated by unit/ number below* 
These iray le considered the primary modules of the 
prr nimar. 



A small ta- ( k f not detailed anywhere else because what 
it say£ here should be enoyph for you to Act the xtiea* 
Jtoie details any be found in the actual cooe if you're 
interested. 
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Features bein^ checked for are in capital letters and 
underlined. Unless otherwise noted* features referred 
to belong to the constituent currently teirp parsed. 



* 11 example oi 

: OP*0 text 



r relevant piece of on utterance which- to be 

parked, reuses through this particular point in tht 
pror.r«t - The underlined constituent is the part which 
i:; particularly relevant, often whf t would be parsed by 
the necrcct call to PAIi>E. 



FWt 



The prcerar called to j,arcc this constituent returns a 
failure mer sa^e. 



*R£TSM*r 



is the usui-1 label for the place in. each urdt's pro/^raji. 
where final semantics uork is done and the program 
returns its parsed constituent which is then attached 
to the tree. 



In general, the flowcharts parallel the code well enough so that the 
correspondences ere cbvious* In one case, this not true* In the 
ilowchartf: fcr finding the first and second object in a claure, □ great 
real of the work is checHnr feature: of the vert anu, should tht parse 
<r :* suitable object succeed, aarkinj the clnufte suitably and setting an 
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jrppropricte re^i^Lor. This work is "hidcen" in the subroutine CAhTAhSE t 
those ar/urcerti; tre: number cf object (1 or £); unit to ]arse if the vcrl 
is arprorriatcly r-arke<i; feature with which to rark clause if p^rse 
succeeds* 
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